Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 139 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.6 KiB |
| Average record size in memory | 48.9 B |
Variable types
| NUM | 10 |
|---|---|
| BOOL | 1 |
Reproduction
| Analysis started | 2020-08-11 23:01:58.744254 |
|---|---|
| Analysis finished | 2020-08-11 23:02:10.503805 |
| Duration | 11.76 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
growth_rate has constant value "0" | Constant |
tests_positive is highly correlated with df_index and 4 other fields | High correlation |
df_index is highly correlated with tests_positive and 4 other fields | High correlation |
tests_negative is highly correlated with df_index and 4 other fields | High correlation |
tests is highly correlated with df_index and 4 other fields | High correlation |
patients_hosp is highly correlated with patients_icu | High correlation |
patients_icu is highly correlated with patients_hosp | High correlation |
recovered is highly correlated with df_index and 4 other fields | High correlation |
rolling_ave is highly correlated with df_index and 4 other fields | High correlation |
df_index has unique values | Unique |
tests_positive has unique values | Unique |
tests_negative has unique values | Unique |
tests has unique values | Unique |
rolling_ave has unique values | Unique |
tests_pending has 117 (84.2%) zeros | Zeros |
patients_icu has 18 (12.9%) zeros | Zeros |
patients_hosp has 13 (9.4%) zeros | Zeros |
patients_vent has 29 (20.9%) zeros | Zeros |
recovered has 13 (9.4%) zeros | Zeros |
| Distinct count | 139 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.0 |
|---|---|
| Minimum | 6 |
| Maximum | 144 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.1 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 12.9 |
| Q1 | 40.5 |
| median | 75 |
| Q3 | 109.5 |
| 95-th percentile | 137.1 |
| Maximum | 144 |
| Range | 138 |
| Interquartile range (IQR) | 69 |
Descriptive statistics
| Standard deviation | 40.26992261 |
|---|---|
| Coefficient of variation (CV) | 0.5369323014 |
| Kurtosis | -1.2 |
| Mean | 75 |
| Median Absolute Deviation (MAD) | 35 |
| Skewness | 0 |
| Sum | 10425 |
| Variance | 1621.666667 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 144 | 1 | 0.7% | |
| 49 | 1 | 0.7% | |
| 55 | 1 | 0.7% | |
| 54 | 1 | 0.7% | |
| 53 | 1 | 0.7% | |
| 52 | 1 | 0.7% | |
| 51 | 1 | 0.7% | |
| 50 | 1 | 0.7% | |
| 48 | 1 | 0.7% | |
| 40 | 1 | 0.7% | |
| Other values (129) | 129 | 92.8% |
| Value | Count | Frequency (%) | |
| 6 | 1 | 0.7% | |
| 7 | 1 | 0.7% | |
| 8 | 1 | 0.7% | |
| 9 | 1 | 0.7% | |
| 10 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 144 | 1 | 0.7% | |
| 143 | 1 | 0.7% | |
| 142 | 1 | 0.7% | |
| 141 | 1 | 0.7% | |
| 140 | 1 | 0.7% |
| Distinct count | 139 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64697.467625899284 |
|---|---|
| Minimum | 71 |
| Maximum | 178009 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 556.0 B |
Quantile statistics
| Minimum | 71 |
|---|---|
| 5-th percentile | 402.5 |
| Q1 | 12634 |
| median | 62266 |
| Q3 | 101105 |
| 95-th percentile | 158564.2 |
| Maximum | 178009 |
| Range | 177938 |
| Interquartile range (IQR) | 88471 |
Descriptive statistics
| Standard deviation | 52409.38191 |
|---|---|
| Coefficient of variation (CV) | 0.810068521 |
| Kurtosis | -0.947107105 |
| Mean | 64697.46763 |
| Median Absolute Deviation (MAD) | 46061 |
| Skewness | 0.4020604077 |
| Sum | 8992948 |
| Variance | 2746743313 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 87294 | 1 | 0.7% | |
| 1869 | 1 | 0.7% | |
| 7696 | 1 | 0.7% | |
| 52050 | 1 | 0.7% | |
| 56401 | 1 | 0.7% | |
| 91984 | 1 | 0.7% | |
| 152911 | 1 | 0.7% | |
| 118606 | 1 | 0.7% | |
| 8268 | 1 | 0.7% | |
| 6974 | 1 | 0.7% | |
| Other values (129) | 129 | 92.8% |
| Value | Count | Frequency (%) | |
| 71 | 1 | 0.7% | |
| 96 | 1 | 0.7% | |
| 127 | 1 | 0.7% | |
| 169 | 1 | 0.7% | |
| 229 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 178009 | 1 | 0.7% | |
| 177964 | 1 | 0.7% | |
| 174660 | 1 | 0.7% | |
| 171821 | 1 | 0.7% | |
| 169034 | 1 | 0.7% |
| Distinct count | 139 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 795824.2733812949 |
|---|---|
| Minimum | 518 |
| Maximum | 2448856 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 556.0 B |
Quantile statistics
| Minimum | 518 |
|---|---|
| 5-th percentile | 4746.6 |
| Q1 | 138872 |
| median | 562066 |
| Q3 | 1344280.5 |
| 95-th percentile | 2223234.4 |
| Maximum | 2448856 |
| Range | 2448338 |
| Interquartile range (IQR) | 1205408.5 |
Descriptive statistics
| Standard deviation | 745635.8535 |
|---|---|
| Coefficient of variation (CV) | 0.9369352989 |
| Kurtosis | -0.7850624495 |
| Mean | 795824.2734 |
| Median Absolute Deviation (MAD) | 492220 |
| Skewness | 0.7122378858 |
| Sum | 110619574 |
| Variance | 5.55972826e+11 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1315413 | 1 | 0.7% | |
| 1837899 | 1 | 0.7% | |
| 648280 | 1 | 0.7% | |
| 165973 | 1 | 0.7% | |
| 2391636 | 1 | 0.7% | |
| 209491 | 1 | 0.7% | |
| 2118323 | 1 | 0.7% | |
| 670364 | 1 | 0.7% | |
| 2197729 | 1 | 0.7% | |
| 8512 | 1 | 0.7% | |
| Other values (129) | 129 | 92.8% |
| Value | Count | Frequency (%) | |
| 518 | 1 | 0.7% | |
| 900 | 1 | 0.7% | |
| 1349 | 1 | 0.7% | |
| 2149 | 1 | 0.7% | |
| 2869 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 2448856 | 1 | 0.7% | |
| 2423595 | 1 | 0.7% | |
| 2391636 | 1 | 0.7% | |
| 2362999 | 1 | 0.7% | |
| 2332495 | 1 | 0.7% |
| Distinct count | 17 |
|---|---|
| Unique (%) | 12.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.215827338129497 |
|---|---|
| Minimum | 0 |
| Maximum | 385 |
| Zeros | 117 |
| Zeros (%) | 84.2% |
| Memory size | 556.0 B |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 265.3 |
| Maximum | 385 |
| Range | 385 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 70.06692066 |
|---|---|
| Coefficient of variation (CV) | 3.646312981 |
| Kurtosis | 13.7433766 |
| Mean | 19.21582734 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.835709983 |
| Sum | 2671 |
| Variance | 4909.373371 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 117 | 84.2% | |
| 1 | 3 | 2.2% | |
| 2 | 3 | 2.2% | |
| 268 | 3 | 2.2% | |
| 270 | 1 | 0.7% | |
| 3 | 1 | 0.7% | |
| 265 | 1 | 0.7% | |
| 10 | 1 | 0.7% | |
| 385 | 1 | 0.7% | |
| 125 | 1 | 0.7% | |
| Other values (7) | 7 | 5.0% |
| Value | Count | Frequency (%) | |
| 0 | 117 | 84.2% | |
| 1 | 3 | 2.2% | |
| 2 | 3 | 2.2% | |
| 3 | 1 | 0.7% | |
| 10 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 385 | 1 | 0.7% | |
| 350 | 1 | 0.7% | |
| 277 | 1 | 0.7% | |
| 270 | 1 | 0.7% | |
| 268 | 3 | 2.2% |
| Distinct count | 139 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 860521.7410071943 |
|---|---|
| Minimum | 589 |
| Maximum | 2626820 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 556.0 B |
Quantile statistics
| Minimum | 589 |
|---|---|
| 5-th percentile | 5149.1 |
| Q1 | 151506 |
| median | 624332 |
| Q3 | 1445385.5 |
| 95-th percentile | 2381798.6 |
| Maximum | 2626820 |
| Range | 2626231 |
| Interquartile range (IQR) | 1293879.5 |
Descriptive statistics
| Standard deviation | 797370.0081 |
|---|---|
| Coefficient of variation (CV) | 0.926612275 |
| Kurtosis | -0.7943924711 |
| Mean | 860521.741 |
| Median Absolute Deviation (MAD) | 542333 |
| Skewness | 0.6946421337 |
| Sum | 119612522 |
| Variance | 6.357989299e+11 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 543999 | 1 | 0.7% | |
| 498759 | 1 | 0.7% | |
| 48208 | 1 | 0.7% | |
| 81999 | 1 | 0.7% | |
| 223822 | 1 | 0.7% | |
| 589 | 1 | 0.7% | |
| 1428674 | 1 | 0.7% | |
| 122440 | 1 | 0.7% | |
| 2155237 | 1 | 0.7% | |
| 525881 | 1 | 0.7% | |
| Other values (129) | 129 | 92.8% |
| Value | Count | Frequency (%) | |
| 589 | 1 | 0.7% | |
| 996 | 1 | 0.7% | |
| 1476 | 1 | 0.7% | |
| 2318 | 1 | 0.7% | |
| 3098 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 2626820 | 1 | 0.7% | |
| 2601604 | 1 | 0.7% | |
| 2566296 | 1 | 0.7% | |
| 2534820 | 1 | 0.7% | |
| 2501529 | 1 | 0.7% |
| Distinct count | 101 |
|---|---|
| Unique (%) | 72.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 207.6474820143885 |
|---|---|
| Minimum | 0 |
| Maximum | 381 |
| Zeros | 18 |
| Zeros (%) | 12.9% |
| Memory size | 556.0 B |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 160.5 |
| median | 217 |
| Q3 | 302 |
| 95-th percentile | 366.3 |
| Maximum | 381 |
| Range | 381 |
| Interquartile range (IQR) | 141.5 |
Descriptive statistics
| Standard deviation | 118.1399234 |
|---|---|
| Coefficient of variation (CV) | 0.5689446471 |
| Kurtosis | -0.8476325487 |
| Mean | 207.647482 |
| Median Absolute Deviation (MAD) | 75 |
| Skewness | -0.4692885604 |
| Sum | 28863 |
| Variance | 13957.0415 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 18 | 12.9% | |
| 206 | 2 | 1.4% | |
| 354 | 2 | 1.4% | |
| 356 | 2 | 1.4% | |
| 362 | 2 | 1.4% | |
| 373 | 2 | 1.4% | |
| 64 | 2 | 1.4% | |
| 165 | 2 | 1.4% | |
| 166 | 2 | 1.4% | |
| 302 | 2 | 1.4% | |
| Other values (91) | 103 | 74.1% |
| Value | Count | Frequency (%) | |
| 0 | 18 | 12.9% | |
| 24 | 1 | 0.7% | |
| 26 | 1 | 0.7% | |
| 27 | 1 | 0.7% | |
| 38 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 381 | 1 | 0.7% | |
| 378 | 1 | 0.7% | |
| 376 | 1 | 0.7% | |
| 373 | 2 | 1.4% | |
| 370 | 1 | 0.7% |
| Distinct count | 117 |
|---|---|
| Unique (%) | 84.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1205.9208633093526 |
|---|---|
| Minimum | 0 |
| Maximum | 1964 |
| Zeros | 13 |
| Zeros (%) | 9.4% |
| Memory size | 556.0 B |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1063 |
| median | 1365 |
| Q3 | 1643.5 |
| 95-th percentile | 1840.6 |
| Maximum | 1964 |
| Range | 1964 |
| Interquartile range (IQR) | 580.5 |
Descriptive statistics
| Standard deviation | 590.9157112 |
|---|---|
| Coefficient of variation (CV) | 0.4900120142 |
| Kurtosis | -0.1094508173 |
| Mean | 1205.920863 |
| Median Absolute Deviation (MAD) | 282 |
| Skewness | -1.043136761 |
| Sum | 167623 |
| Variance | 349181.3777 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 13 | 9.4% | |
| 1777 | 2 | 1.4% | |
| 1341 | 2 | 1.4% | |
| 849 | 2 | 1.4% | |
| 1455 | 2 | 1.4% | |
| 1749 | 2 | 1.4% | |
| 1769 | 2 | 1.4% | |
| 117 | 2 | 1.4% | |
| 1763 | 2 | 1.4% | |
| 1210 | 2 | 1.4% | |
| Other values (107) | 108 | 77.7% |
| Value | Count | Frequency (%) | |
| 0 | 13 | 9.4% | |
| 49 | 1 | 0.7% | |
| 62 | 1 | 0.7% | |
| 66 | 1 | 0.7% | |
| 76 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 1964 | 1 | 0.7% | |
| 1961 | 1 | 0.7% | |
| 1914 | 1 | 0.7% | |
| 1874 | 1 | 0.7% | |
| 1857 | 1 | 0.7% |
| Distinct count | 87 |
|---|---|
| Unique (%) | 62.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 107.26618705035972 |
|---|---|
| Minimum | 0 |
| Maximum | 252 |
| Zeros | 29 |
| Zeros (%) | 20.9% |
| Memory size | 556.0 B |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 41 |
| median | 107 |
| Q3 | 161 |
| 95-th percentile | 234.3 |
| Maximum | 252 |
| Range | 252 |
| Interquartile range (IQR) | 120 |
Descriptive statistics
| Standard deviation | 77.51489577 |
|---|---|
| Coefficient of variation (CV) | 0.722640544 |
| Kurtosis | -1.05635318 |
| Mean | 107.2661871 |
| Median Absolute Deviation (MAD) | 59 |
| Skewness | 0.07751885322 |
| Sum | 14910 |
| Variance | 6008.559066 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 29 | 20.9% | |
| 97 | 3 | 2.2% | |
| 30 | 3 | 2.2% | |
| 141 | 3 | 2.2% | |
| 88 | 3 | 2.2% | |
| 112 | 2 | 1.4% | |
| 96 | 2 | 1.4% | |
| 106 | 2 | 1.4% | |
| 92 | 2 | 1.4% | |
| 199 | 2 | 1.4% | |
| Other values (77) | 88 | 63.3% |
| Value | Count | Frequency (%) | |
| 0 | 29 | 20.9% | |
| 18 | 2 | 1.4% | |
| 30 | 3 | 2.2% | |
| 41 | 2 | 1.4% | |
| 43 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 252 | 1 | 0.7% | |
| 250 | 1 | 0.7% | |
| 248 | 1 | 0.7% | |
| 244 | 1 | 0.7% | |
| 242 | 1 | 0.7% |
| Distinct count | 127 |
|---|---|
| Unique (%) | 91.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31446.208633093524 |
|---|---|
| Minimum | 0 |
| Maximum | 90149 |
| Zeros | 13 |
| Zeros (%) | 9.4% |
| Memory size | 556.0 B |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2340.5 |
| median | 25387 |
| Q3 | 55557.5 |
| 95-th percentile | 82429.2 |
| Maximum | 90149 |
| Range | 90149 |
| Interquartile range (IQR) | 53217 |
Descriptive statistics
| Standard deviation | 29523.69785 |
|---|---|
| Coefficient of variation (CV) | 0.938863511 |
| Kurtosis | -1.236664188 |
| Mean | 31446.20863 |
| Median Absolute Deviation (MAD) | 24351 |
| Skewness | 0.4545167393 |
| Sum | 4371023 |
| Variance | 871648734.5 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 13 | 9.4% | |
| 36185 | 1 | 0.7% | |
| 33856 | 1 | 0.7% | |
| 4161 | 1 | 0.7% | |
| 46662 | 1 | 0.7% | |
| 67143 | 1 | 0.7% | |
| 72207 | 1 | 0.7% | |
| 8523 | 1 | 0.7% | |
| 76 | 1 | 0.7% | |
| 6347 | 1 | 0.7% | |
| Other values (117) | 117 | 84.2% |
| Value | Count | Frequency (%) | |
| 0 | 13 | 9.4% | |
| 31 | 1 | 0.7% | |
| 76 | 1 | 0.7% | |
| 107 | 1 | 0.7% | |
| 121 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 90149 | 1 | 0.7% | |
| 88412 | 1 | 0.7% | |
| 87249 | 1 | 0.7% | |
| 86157 | 1 | 0.7% | |
| 84981 | 1 | 0.7% |
| Distinct count | 1 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 556.0 B |
| 0 |
|---|
| Value | Count | Frequency (%) | |
| 0 | 139 | 100.0% |
| Distinct count | 139 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60620.10791366907 |
|---|---|
| Minimum | 16 |
| Maximum | 173907 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 556.0 B |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 118.2 |
| Q1 | 10100.5 |
| median | 55400 |
| Q3 | 98561.5 |
| 95-th percentile | 153699.2 |
| Maximum | 173907 |
| Range | 173891 |
| Interquartile range (IQR) | 88461 |
Descriptive statistics
| Standard deviation | 51475.88339 |
|---|---|
| Coefficient of variation (CV) | 0.8491552583 |
| Kurtosis | -0.9808194556 |
| Mean | 60620.10791 |
| Median Absolute Deviation (MAD) | 44972 |
| Skewness | 0.4452803553 |
| Sum | 8426195 |
| Variance | 2649766571 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 92159 | 1 | 0.7% | |
| 86860 | 1 | 0.7% | |
| 132952 | 1 | 0.7% | |
| 123735 | 1 | 0.7% | |
| 148054 | 1 | 0.7% | |
| 153427 | 1 | 0.7% | |
| 78671 | 1 | 0.7% | |
| 14158 | 1 | 0.7% | |
| 121752 | 1 | 0.7% | |
| 8510 | 1 | 0.7% | |
| Other values (129) | 129 | 92.8% |
| Value | Count | Frequency (%) | |
| 16 | 1 | 0.7% | |
| 23 | 1 | 0.7% | |
| 32 | 1 | 0.7% | |
| 42 | 1 | 0.7% | |
| 56 | 1 | 0.7% |
| Value | Count | Frequency (%) | |
| 173907 | 1 | 0.7% | |
| 170863 | 1 | 0.7% | |
| 167853 | 1 | 0.7% | |
| 164802 | 1 | 0.7% | |
| 161826 | 1 | 0.7% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | tests_positive | tests_negative | tests_pending | tests | patients_icu | patients_hosp | patients_vent | recovered | growth_rate | rolling_ave | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 6 | 71 | 518 | 49 | 589 | 0 | 0 | 0 | 0 | 0 | 16 |
| 1 | 7 | 96 | 900 | 52 | 996 | 0 | 0 | 0 | 0 | 0 | 23 |
| 2 | 8 | 127 | 1349 | 17 | 1476 | 0 | 0 | 0 | 0 | 0 | 32 |
| 3 | 9 | 169 | 2149 | 10 | 2318 | 0 | 0 | 0 | 0 | 0 | 42 |
| 4 | 10 | 229 | 2869 | 0 | 3098 | 0 | 0 | 0 | 0 | 0 | 56 |
| 5 | 11 | 306 | 3719 | 35 | 4025 | 0 | 0 | 0 | 0 | 0 | 72 |
| 6 | 12 | 344 | 4257 | 350 | 4601 | 0 | 0 | 0 | 0 | 0 | 93 |
| 7 | 13 | 409 | 4801 | 385 | 5210 | 0 | 0 | 0 | 0 | 0 | 121 |
| 8 | 14 | 473 | 6632 | 270 | 7105 | 0 | 0 | 0 | 0 | 0 | 160 |
| 9 | 15 | 565 | 7619 | 268 | 8184 | 0 | 0 | 0 | 0 | 0 | 208 |
Last rows
| df_index | tests_positive | tests_negative | tests_pending | tests | patients_icu | patients_hosp | patients_vent | recovered | growth_rate | rolling_ave | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 129 | 135 | 152911 | 2156141 | 0 | 2309052 | 253 | 1455 | 131 | 79961 | 0 | 148054 |
| 130 | 136 | 156300 | 2197729 | 0 | 2354029 | 302 | 1455 | 137 | 80997 | 0 | 150725 |
| 131 | 137 | 158253 | 2220028 | 0 | 2378281 | 297 | 1472 | 139 | 82315 | 0 | 153427 |
| 132 | 138 | 161365 | 2252092 | 0 | 2413457 | 302 | 1480 | 145 | 83457 | 0 | 156149 |
| 133 | 139 | 164795 | 2292128 | 0 | 2456923 | 292 | 1496 | 146 | 84187 | 0 | 158950 |
| 134 | 140 | 169034 | 2332495 | 0 | 2501529 | 278 | 1647 | 210 | 84981 | 0 | 161826 |
| 135 | 141 | 171821 | 2362999 | 0 | 2534820 | 285 | 1964 | 211 | 86157 | 0 | 164802 |
| 136 | 142 | 174660 | 2391636 | 0 | 2566296 | 290 | 1961 | 212 | 87249 | 0 | 167853 |
| 137 | 143 | 178009 | 2423595 | 0 | 2601604 | 262 | 1857 | 215 | 88412 | 0 | 170863 |
| 138 | 144 | 177964 | 2448856 | 0 | 2626820 | 271 | 1640 | 167 | 90149 | 0 | 173907 |